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We present quasiparticle self-consistent GW (QSGVF) calculations of semimetallic bulk Bi. We 
go beyond the conventional QSG1F method by including the spin-orbit coupling throughout the 
self-consistency cycle. This approach improves the description of the electron and the hole pock¬ 
ets considerably with respect to standard density functional theory (DFT), leading to excellent 
agreement with experiment. We employ this relativistic QSGIF approach to conduct a study of 
the semimetal-to-semiconductor and the trivial-to-topological transitions that Bi experiences under 
strain. DFT predicts that an unphysically large strain is needed for such transitions. We show, by 
means of the relativistic QSGIT description of the electronic structure, that an in-plane tensile strain 
of only 0.3% and a compressive strain of 0.4% are sufficient to cause the semimetal-to-semiconductor 
and the trivial-to-topological phase transitions, respectively. Thus, the required strain moves into 
a regime that is likely to be realizable in experiment, which opens up the possibility to explore 
bulklike topological behavior of pure Bi. 

PACS numbers: 71.10.-w,71.55.Ak,71.15.Mb,71.20.-b,71.70.Ej 


Bismuth exhibits a series of peculiarities that has made 
it the subject of experimental and theoretical interest 
for decades. The very low density of carriers with high 
carrier mobility and a very long mean free path make 
bismuth very interesting for electronic transport studies. 
With the recent introduction of the classification of solids 
according to topology classes of the electronic structure^ 
Bi has again moved into the limelight of current discus¬ 
sions. With a lattice structure that bears similarities to 
graphene and a very large spin-orbit interaction, it has 
all the potential to be a topological semimetal or semi¬ 
conductor. From the theoretical point of view, the large 
spin-orbit coupling (SOC), the semimetallic character, 
and the very small local direct band gap at the L point 
of the Brillouin zone make ab initio calculations of the 
electronic structure of Bi very challenging. 

As for its topological classification, some important 
aspects are not yet fully understood. It is widely ac¬ 
cepted that bulk Bi is topologically trivial in contrast to 
Sb. This is in agreement with angle-resolved photoemis¬ 
sion spectroscopy (ARPES) measurements of Bi2~— and 
Bii_ x Sb x alloys^— as well as transport studies, in which 
a semimetal-to-semiconductor (SMSC)£ and a topolog¬ 
ical transition^ are observed as the Sb concentration is 
varied from the topologically trivial Bi to the topological 
semimetal Sb, resulting in three-dimensional (3D) topo¬ 
logical insulators for some of the alloys. However, contro¬ 
versies concerning the topological properties of bismuth 
persist. For example, it has recently been claimed^ on 
the basis of ARPES measurements that bulk bismuth is 
topologically nontrivial, a claim that is in conflict with 
most previous findings including ab initio calculations 
based on density functional theory (DFT). The authors 
of Ref. 0 attributed these discrepancies to the system¬ 
atic underestimation of the band gaps obtained in the 
local-density (LDA) or generalized gradient (GGA) ap¬ 


proximations of DFT. Given the very small experimental 
value (11-15 meV— ~— ) for the direct band gap at the L 
point (Eg), whose “sign” (order of the states) controls the 
topological nature of Bi, it is very conceivable that DFT 
might incorrectly predict the sign of the band gap, and 
not just its magnitude. Our parity analysis within LDA 
is in agreement with results in the literature (see, e.g., 
Refs. ll6H18h and confirms the trivial character of Bi. But 
surprisingly, LDA presents an unusual overestimation of 
the band gap at L (86 meV; see Table HI , which is an ef¬ 
fect usually characteristic of inverted gaps, as with those 
in topological insulators : -™ : ™ The gap at L is indeed in¬ 
verted, as can be seen by varying the SOC strength from 
zero to 100%: the two states at L exchange order for a 
certain SOC strength. But the gap at T is inverted as 
well. There is, hence, an even number of inverted gaps 
at the time-reversal invariant momenta, giving rise to a 
topologically trivial value of the Z 2 invariant: 21 : 22 The 
GGA approximation hardly improves the LDA results 
(Table Q}. The poor description of the band gaps casts 
doubt on the LDA and GGA studies of the trivial-to- 
topological (TT)22 i 24 or the SMSCS^ transitions of Bi, 
as well as the TT transition in Bii_ x Sb x alloys, as these 
studies critically depend on the sign and the value of the 
band gap at L. 

The cause of the band-gap problem of DFT is well 
known. Strictly, the single-particle energies cannot be 
interpreted as quasiparticle energies of the interacting 
electron system. The GW approximation^ for the elec¬ 
tronic self-energy, applied as a one-shot correction to 
DFT or, more rarely, in the quasiparticle self-consistent 
GW (QSGW) approach^ remedies the aforementioned 
problem and is often used to correct the band gaps ob¬ 
tained from LDA or GGA. In most cases it increases 
the band gap leading to a much better agreement with 
experiments^ However, we show here (c/. Table QJ that 
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TABLE I. Values of the direct band gap at L (E s ) and the 
indirect band gap (Eo) [see Fig. EJb)] for bulk bismuth (in 
meV) calculated within LDA, GGA, GW, and QSGVF, com¬ 
pared with experimental results. 


LDA GGA GW QSGW 

Expt. 

direct gap at L (A g ) 86 

74 

32 

13 

11 - 15^ 

indirect gap T-L (Eo) —105 

-72 

-66 

-33 

-32 - -3!A 


a 11-0 O, 13.6 [11, 15.0 [I3and|T3, 15.3 0 
b -32.0 H3, -36.0 0, -38.0 0, -38.2 0 ,-38.5 0 


iii the case of bismuth, the GW quasiparticle correction is 
negative, i.e., it correctly reduces the band gap to 32 meV 
in one-shot GW (which corresponds to the first itera¬ 
tion of QSGLF) and to 13 meV within the self-consistent 
QSGLF scheme. The latter value lies in the range of 
experimental values. For these calculations, we have de¬ 
vised and implemented a method that combines the SOC 
with the many-body renormalization in a consistent way 
(see below). The sign of the band gap is not changed by 
the quasiparticle correction, i.e., bismuth remains triv¬ 
ial, in accordance with the vast majority of experimental 
and theoretical studies. However, the phase boundary at 
which bismuth becomes topologically nontrivial is consid¬ 
erably closer in QSGLF than in LDA. We will show that 
only a relatively modest in-plane tensile strain of 0.3% 
and compressive strain of 0.4% are sufficient to cause 
the SMSC and TT phase transition, respectively, which 
opens up the possibility to explore the 3D topological 
properties of pure Bi. 

Bulk bismuth crystallizes in the A7 rhombohedral 
structure with R3m space group and two atoms per 
unit cell. The cell and the atomic positions are de¬ 
termined by the internal parameter u , the rhombohe¬ 
dral lattice parameter a r ho, and the rhombohedral an¬ 
gle a r ho [see Fig. Oja)]. The layered structure consists 
of Bi bilayers separated by a van der Waals gap. For 
the undistorted reference lattice, we employ experimental 
parameters from Ref. [31: a r ho=4.7458 A, a r h o =57.230°, 
u=0. 23389. Equivalently, the structure can be described 
as hexagonal with in-plane and out-of-plane lattice con¬ 
stants ao=4.5460 A and co=11.862 A (co/ao=2.6093). 
These lattice parameters were measured at 298 K. The 
gray vertical areas in Fig.[3]show the range of experimen¬ 
tal values of c/a for temperatures between 4 and 298 K— 
All of our calculations are carried out with the DFT code 
FLEUR— and the GW code SPEX^ realizations in the all¬ 
electron full-potential linearized augmented-plane-wave 
(FLAPW) formalism. The convergence parameters of the 
DFT, GW, and QSGLF calculations are given in the Ap¬ 
pendix. To include relativistic effects in the DFT calcula¬ 
tions, we employ the scalar-relativistio^ approximation 
inside the muffin-tin spheres, and the SOC is included in 
a self-consistent manner.~ 

Most of the one-shot GW calculations that include 
SOC and all of the QSGLF ones published so far, employ 
the SOC as an a posteriori correction after the quasi- 
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FIG. 1. (Color online) (a) Rhombohedral (green) unit cell and 
hexagonal crystal structure of bulk Bi. The blue and purple 
atoms distinguish the atoms in the two layers of each bilayer, 
(b) Bulk Brillouin zone of Bi expressed in the rhombohedral 
unit cell. 


particle correction to DFT has been carried out with¬ 
out SOC. This is, of course, an approximation. For ex¬ 
ample, many-body renormalization effects of spin-orbit 
split bands are then not taken into account (see Ref. [IB 
for a detailed discussion). In our relativistic calculations, 
the SOC is already incorporated in the noninteracting 
system that serves as the starting point for the quasi¬ 
particle calculation i 36 ’ 37 The one-shot GW self-energy 
thus acquires terms that couple the two spin channels, 
enabling a many-body renormalization of the SOC it¬ 
self. In this work, we propose to extend this principle to 
the self-consistent QSGLF calculations, so that the self¬ 
energy contains spin off-diagonal blocks that it inherits 
from the SOC throughout the whole self-consistent pro¬ 
cess. (In analogy to the notation G soc IF soc of Ref. H3, 
one could denote this approach by QSG soc LF soc , but 
for simplicity we will simply write GW and QSGLF in the 
following.) Our results show that this treatment is cru¬ 
cial for Bi. The a posteriori correction mentioned above, 
on the other hand, leads to unphysical results for Bi (see 
Appendix), which then appears as a topological insulator 
with large band gaps (83 and 259 meV in one-shot GW 
and QSGLF, respectively) instead of a trivial semimetal. 

In addition to the value of the band gap at the L point, 
the properties of Bi are also determined by the states at 
the symmetry point T. In particular, the semimetallic 
character of bismuth is caused by the overlap between 
the highest valence band at T and the lowest conduction 
band at L [see Fig. |2jb)] . This creates three very small 
electron pockets (at the three L points in the Brillouin 
zone) and one hole pocket at T. Several attempts to de¬ 
scribe these pockets theoretically have been published in 
recent years. For a review and comparison of them see, 
e.g., Table II in Ref. |T3- The approaches used include 
LDA and GGA , 17 ’ 38 ' 39 tight-binding models,— and em¬ 
pirical pseudopotential calculations^ but no GW results 
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FIG. 2. (Color online) LDA, GW, and QSGFF band struc¬ 
tures of bulk Bi. The inset in (b) shows the electron and hole 
pockets together with the E g and Eo gaps corresponding to 
the LDA bands. 


have been published so far. The results from the models 
and DFT calculations have not been satisfactory: the ab¬ 
solute value of the overlap, or in other words the indirect 
band gap Eo, came out too large in general compared to 
experiment. 

Our values of Eq are shown in Table fl] Both the LDA 
and GGA overestimate indeed this indirect band gap, 
in line with previous publications. While the one-shot 
GW approach does correct Eo in the right direction, it 
uses LDA as a starting point and, thus, inherits some 
of this overestimation. The self-consistent procedure of 
QSGW, on the other hand, makes the result independent 
of the starting point and moves Eq inside the range of 
experimental values. 

Figure [2] shows the band structure of bulk bismuth 
obtained with LDA, GW, and QSGW along the path 
connecting the high-symmetry k points represented in 
Fig. [Hb). Whereas the overall shape of the band struc¬ 
ture remains basically unaffected by the many-body cor¬ 
rections, there are important changes in the details. In 
particular, while in most parts the valence and conduc¬ 
tion bands are shifted apart by the many-body correc¬ 
tions as commonly seen in GW calculations, the changes 
of the dispersion in the vicinity of the Fermi energy are 
such that the absolute values of E g and E 0 get smaller 
and reach values of 13 and —33 meV, respectively. As has 
already been discussed before ) 19 ’ 20 ! 36 ’ 37 this is the result 
of a delicate interplay between the SOC and the many- 
body renormalization described by the GW approxima¬ 
tion. We conclude that the present relativistic QSGW 
is the method of choice to study the band structure of 


Bi and its topological character as well as the TT and 
SMSC transitions that Bi experiences under strain. 

A TT transition was indeed recently observed exper¬ 
imentally for strained Bi(lll) ultrathin films^ and a 
similar transition was found for strained Bi nanowires 
elongated about 2In the case of thin films, a strain 
can be introduced naturally by a lattice mismatch be¬ 
tween film and substrate. To simulate the strain in our 
calculations, we apply a volume-conserving distortion of 
the structure by varying the c/a ratio, and keeping the 
internal parameter u constant. A compression in the 
c direction is accompanied by a corresponding increase 
of a, and vice versa. This is equivalent to varying the 
rhombohedral angle a r h 0) see Fig. [0(a). As the c/a ra¬ 
tio increases, the nearest-neighbor distance reduces and 
the van der Waals gap expands. Both effects lead to 
a strengthening of the in-plane interactions, while the 
interbilayer coupling weakens. The altered crystal field 
reduces the band gap at L until it reaches a zero value at 
a critical c/a ratio and becomes negative. This undoes 
the band inversion and causes a sign change in the cal¬ 
culation of the Z 2 topological invariants; 21 ’ 22 which, for 
systems with inversion symmetry, derive from the pari¬ 
ties of the occupied states at the time-reversal invariant 
momenta. The parity of a wave function remains un¬ 
changed upon applying the GW quasiparticle correction. 
The Z 2 topological invariants can, thus, only change if 
there is an interchange of valence and conduction states 
with respect to LDA. A topological-to-trivial transition 
in the topological insulator Bi 2 Se 3 as a result of strain 
has also been discussed in the literature. It has been 
argue d 42 ’ 43 that tensile out-of-plane strain reduces the 
spin-orbit strength leading to a trivial phase. We do not 
find such an effect for bulk Bi: no substantial increase 
or decrease in the spin-orbit strength is observed upon 
application of strain. 

Figure ED a) shows the direct band gap as a function of 
the c/a ratio with the range of equilibrium experimental 
values of bulk Bi represented as gray shaded areas. Posi¬ 
tive values of the gap denote a trivial phase, whereas neg¬ 
ative values correspond to a topologically nontrivial case. 
In addition to the semimetallic state (solid symbols) in 
Fig. (3](a), we also observe a transition to a semiconduct¬ 
ing state (open symbols) for tensile in-plane strain. This 
SMSC transition can also be seen in Fig. Etc) showing 
the variation of the indirect band gap, which changes 
sign and becomes positive at the point of transition. 

At the SMSC transition, the GW and QSGW curves 
exhibit abnormal behavior; there is a jump (dashed lines) 
where we expect the curve to be continuous. An analo¬ 
gous steplike behavior can be seen in the QSGW curve 
at the TT transition. As shown in the Appendix, these 
anomalies are an artifact of the k-point convergence: 
The steps get systematically smaller when one improves 
the k-point sampling, and convergence is achieved much 
more readily in the trivial semimetallic phase than in the 
other two phases, so that we can take the points be¬ 
tween the transitions in Fig. [3] as converged, and the 
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FIG. 3. (Color online) (a) LDA, GGA, GW, and QSGkF di¬ 
rect band gap E s of bismuth at L for different strains (vary¬ 
ing the ratio c/a and keeping the volume constant), (b) Three 
electronic phases are found for Bi as a function of the ratio c/a 
within all the approaches, (c) LDA, GGA, GW, and QSGkF 
indirect band gap Eo of bismuth for different values of c/a. 
On the upper x axis of (a) and (c), we show the percentage 
of in-plane strain. The range of experimental values of c/a,— 
the experimental gap at L,—as well as the experimental 
indirect ga p 13 ’ 15 ’ 29 ’ 30 are represented as gray shaded areas in 
(a) and (c). Small differences to the QSGFF values in Table [I] 
are due to the use of a reduced k-point set; see Appendix. 


fitted curves (solid lines) show the expected behavior 
in the sampling limit. We have employed a quadratic 
function for the direct gap and a linear function for 
the indirect gap as a function of c/a, which for QSGkF 
give E s = —1226.28(c/a) 2 + 5939.05(c/a) - 7131.51 and 
E 0 = -1827.57(c/a) + 4717.26 in meV. 

Figure [3|b) shows the three electronic phases that we 
can distinguish in bulk Bi under strain: a trivial semicon¬ 
ducting phase, a trivial semimetallic phase, and a topo¬ 
logical semimetallic phase. The overlap between the elec¬ 
tron and hole pockets, or the indirect gap, is caused by 


TABLE II. Values of the lattice parameters a and c and the 
ratio a/c for which (a) the TT and (b) the SMSC transitions 
occur for the different theoretical approaches. The values in 
brackets indicate the compressive (—) or tensile (+) strain in 
percentage. The difference in the total energies (AEtot) 
between the structure for which the transition occurs and 
the unperturbed reference lattice has been calculated within 
LDA. Note that we used ao=4.5460 A, co=11.862 A, and 
co/ao=2.6093^i for the undistorted lattice. 



a t c t c t /a t 

AFtot 


(A) (A) 

(meV) 

(a) TT: 

LDA 

4.461(—1.9%) 12.315 2.761(+5.8%) 

15.8 

GGA 

4.475(—1.6%) 12.238 2.735(+4.8%) 

11.7 

GW 

4.513(—0.7%) 12.036 2.667(+2.2%) 

4.1 

QSGkF 

4.527(—0.4%) 11.960 2.642(+1.2%) 

1.9 

(b) SMSC: 

LDA 

4.579(+0.7%) 11.690 2.553(-2.2%) 

0.5 

GGA 

4.580(+0.7%) 11.685 2.551(-2.2%) 

0.6 

GW 

4.569(+0.5%) 11.744 2.570(-1.5%) 

0.2 

QSGkF 

4.562(+0.3%) 11.776 2.581(-1.1%) 

0.1 


the distortion of the cell with respect to a cubic structure. 
It seems, therefore, that bismuth cannot be in a topologi¬ 
cal insulating phase because the stress necessary to make 
it topological distorts the cell rhombohedrally, distancing 
it even further from the cubic symmetry so that Bi be¬ 
comes more and more metallic. Figure [3] demonstrates 
that, whereas the two transitions are qualitatively de¬ 
scribed correctly by the four approaches, the LDA and 
GGA predict a large critical strain that would be very 
difficult to achieve experimentally. In particular, the TT 
transition is predicted to occur at a critical in-plane com¬ 
pressive strain of 1.9% and 1.6%, respectively (see Ta¬ 
ble DU). On the contrary, the strain needed to cause the 
transition is significantly smaller in GW (0.7%) and, in 
particular, in QSGkF (0.4%). In other words, taking into 
account structural relaxation, significantly thicker sam¬ 
ples could be grown under the relatively modest critical 
strain that is predicted by QSGkF than under that pre¬ 
dicted by DFT. 

In conclusion, we propose a QSGkF scheme in which 
the SOC is included throughout the whole self-consistent 
process. Our results show that such relativistic QSGkF 
calculations are the right ab initio approach to determine 
the electronic structure and the topological classification 
of bismuth. They predict its two most significant and 
delicate electronic properties in quantitative agreement 
with experiments: The direct gap at L, which determines 
whether Bi is topological or trivial, and the indirect gap 
between L and T, which describes the overlap between 
the electron and hole pockets and thus determines the 
semimetallic character of Bi. In response to a recent 
controversy^ we conclude the following from our cal¬ 
culations: bulk Bi is a trivial semimetal. 

We showed that Bi can undergo electronic phase tran¬ 
sitions under rather small lattice strain: a trivial SMSC 
transition under 0.3% in-plane tensile strain and a TT 
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semimetal transition under 0.4% compressive strain. In 
contrast to LDA and GGA, the critical strains predicted 
by QSGW are attainable by experiments. The pre¬ 
sented results motivate additional experimental efforts 
to prepare Bi as a topological insulator, by opening band 
gaps in slightly strained Bi. For example, the small 
strain needed to cause the TT transition concluded by 
this work, together with the SMSC transition driven by 
quantum-size effects in relatively thick films reveals the 
potential to observe a topological insulating behavior in 
bulk-like films of pure Bi. In fact, first experimental evi¬ 
dences of 3D-like topological thin films of Bi are currently 
under discussion^ 

The present results demonstrate that LDA and GGA 
lack the quantitative accuracy to predict the critical 
strain for which the electronic phase transitions occur. 
We speculate that standard DFT also yields wrong pre¬ 
dictions for the critical concentration at which Bi ex¬ 
periences a TT transition upon alloying with other el¬ 
ements (such as in Bii-^Sb^ alloys*^”— ^) or for the 
critical thickness at which a SMSC transition caused by 
quantum-size effects takes place. Bi is one example, but 
we believe it is much more general. Other examples could 
include the very large strains necessary to induce topo¬ 
logical phase transitions in B^Se; , 42 ' 43 and in TlBiS 2 and 
TlSbS 2 It would thus be desirable to reinvestigate the 
critical points for these transitions within the relativistic 
QSGIF method proposed in this work. 
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APPENDIX 

The calculations are carried out with the DFT code 
FLEURS and the GW code spex^ which are based on 
the all-electron FLAPW formalism. For the valence elec¬ 
trons, space is partitioned into spherical regions around 
the atoms (muffin-tin spheres) and interstitial region be¬ 
tween the spheres. We use an angular momentum cutoff 
Imax = 10 in the muffin-tin spheres and a plane-wave 
cutoff of 4.5 bohr” 1 in the interstitial region. For the 
DFT calculations, we employ either the Perdew-Zunger— 
parametrization of the LDA exchange-correlation func¬ 
tional or the Perdew-Burke-Ernzerho^ parametrization 
of the GGA. The one-shot GW calculations are always 


performed using the LDA mean-held system as a starting 
point. 

Due to the very small band gap of bulk Bi (11 to 
15 meV for the direct one^ ” 13 ’ 15 and —32 to —39 for 
the indirect on o 13 i 15 i 29 i 30 ), all computational parameters 
entering the GW and QSGW calculations are chosen af¬ 
ter a systematic study in order to obtain thoroughly con¬ 
verged results. An angular momentum cutoff of l = 5 and 
a linear momentum cutoff of 2.9 bohr” 1 are employed to 
construct the mixed product basis ; 33 ' 49 used to represent 
the dielectric matrix and the screened interaction. 

The one-shot GW and self-consistent QSGW results 
for the undistorted structure are obtained with a 6 x 6 x 6 
k-point mesh sampling the Brillouin zone (Table [T] and 
Fig. [5J. Fortunately, we found that the QSGW results 
converge faster with respect to the number of k points 
than the results of one-shot GW so that we could afford 
to reduce the k-point mesh to 4x4x4 for the QSGW 
calculations of bismuth under strain. For more details 
about the k-point convergence, see below. 

To compute the Green function and the polarization 
function, 500 bands are used. This corresponds to ap¬ 
proximately 130 eV above the Fermi energy. The com¬ 
plete fifth shell of Bi is treated as valence states by the 
use of local orbitals. In this way, the contribution of 
the 5s5p5d states to the Green function and the elec¬ 
tronic screening is fully taken into account. Despite their 
low energetic position, inclusion of the 5s and 5 p states 
effects a change of 5 meV in the band gap at L. To de¬ 
scribe high-lying states accurately and to avoid lineariza¬ 
tion errors*^— we complement the basis for the valence 
electrons for each atom by two local orbitals per angular 
momentum up to l = 3 with energy parameters far up in 
the unoccupied states. For the interpolation of the band 
structures in Fig. [2j maximally localized Wannier func¬ 
tions obtained by the WANNIER.90 library^ and a 6x6x6 
k-point mesh were employed. 

We employ the Dirac equation for the core electrons 
(up to and including the fourth shell) so that relativistic 
effects are fully accounted for. The core states are thus 
represented by four-component spinors. This represen¬ 
tation is retained for the evaluation of the core-valence 
contribution to the exchange self-energy. (Using aver¬ 
aged core states that are represented in terms of non- 
relativistic Im spherical harmonics instead of the jm 3 
spinors introduces an error of 4 and 7 meV in the direct 
band calculated with one-shot GW and QSGW, respec¬ 
tively.) 

As a semimetal, bismuth exhibits metallic screening, 
which is described technically by the so-called Drude 
term in the screened interaction W. This term stems 
from virtual intraband transitions across the Fermi sur¬ 
face. It can be formulated in a functional form being pro¬ 
portional to the square of the plasma frequency, which 
in turn is evaluated by an integration over the Fermi 
surface. The Drude term can be treated analytically^ 
and, as long as the Fermi surface is sufficiently big, it 
normally does not pose any numerical problem. How- 
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FIG. 4. (Color online) Convergence of the QSGW direct band 
gap at the L point: without SOC (circles), with SOC added a 
posteriori (triangles), and with the relativistic approach used 
in the main text (squares). Iteration 0, 1, and 10, correspond 
to the LDA, one-shot GW, and QSGIF values, respectively. 


ever, bismuth has a very small Fermi surface due to the 
tiny electron and hole pockets, which eventually leads to 
a very sharp “Drude peak” in the GW self-energy, im¬ 
peding a straight-forward numerical solution of the non¬ 
linear quasiparticle equation. One could also say that, 
while the Drude term is actually treated correctly, it 
gets too much weight because of the finite k-point mesh. 
Therefore, we neglect the Drude term in our calculations 
and, instead, simply scale the head element of W(k, w) 
in the limit k —> 0 to enforce metallic screening. This 
hardly changes W, and we have found that it leads to 
a very favorable k-point convergence; see below. (In the 
limit of dense k-point sets, the treatment of the Drude 
term, which affects only a single k point, i.e., the T point, 
becomes immaterial.) 

In Fig. [Tj we show the direct band gap at the L point for 
each iteration in the QSGW self-consistency procedure, 
in particular, we distinguish between calculations (i) 
without SOC, (ii) with spin-orbit coupling added a poste¬ 
riori to a result obtained without SOC (QSGfF+SOC), 
which is a correction scheme used in most GW calcula¬ 
tions so far, and (iii) the relativistic approach used in the 
present work in which spin-orbit coupling is included in 
the calculation of both G and W throughout the whole 
self-consistent procedure. (In an earlier publication, 3 ® we 
have used the notation G soc W soc for the corresponding 
one-shot calculation. By analogy, one could use the no¬ 
tation QSG 800 !!^ 00 for the present self-consistent ap¬ 
proach.) The a posteriori SOC correction leads to un- 
physical results for Bi, which appears as a topological in¬ 
sulator instead of a trivial semimetal. We attribute the 
lack of published GW results of bulk Bi to the crucial 


and delicate treatment of SOC, but also to the difficul¬ 
ties encountered with the semimetallicity of the system 
exhibiting very tiny electron and hole pockets (leading to 
a very sharp Drude peak in the self-energy, as discussed 
above) and to the smallness of the gaps requiring the 
calculations to be converged to within a few meV. 

We have also performed relativistic QSGWo calcula¬ 
tions, i.e., we keep the W at the LDA level while G 
is updated self-consistently. The converged QSGWo re¬ 
sult for the band gap at L differs only by 0.3 meV from 
the QSGW one. This indicates that the screening ob¬ 
tained with LDA is a good approximation and that self- 
consistency induces changes mainly in G. Finally, to 
quantify the effects of dynamics, we have also performed 
self-consistent calculations within the static Coulomb- 
hole screened-exchange (COHSEX)2& approximation to 
the GW self-energy (including SOC). In this approxima¬ 
tion, the changes are more pronounced. With respect 
to QSGW, the direct gap at L is reduced by 12 meV, 
nearly making the system topological, and the indirect 
band gap shrinks to -4 meV, nearly making the system 
semiconducting. 

To explain the discontinuities in Figs. [3](a) and [Bjc) 
and to prove their origin as a k-point convergence issue, 
we have performed GW and QSGW calculations with in¬ 
creasing number of k points (Fig. [5j. To allow for these 
calculations, we had to reduce some of the parameters in 
the calculation such as the plane-wave cutoff in the inter¬ 
stitial region for the LDA calculations (now 4.0 bohr -1 ), 
and both the angular momentum cutoff (l = 4) and the 
linear momentum cutoff (2.6 bohr -1 ) to construct the 
mixed product basis for the GW calculations. We also 
reduced the number of bands for the GW calculations to 
100 and only 5 d local orbitals were used in the calcula¬ 
tions. This obviously reduces the accuracy of the band 
gaps obtained—and therefore they differ from those of 
the main text— but it is not our intention in this ap¬ 
pendix to analyze the quantitative results, only the qual¬ 
itative behavior due to the k-point convergence. 

From Fig. [5J we can conclude the following about the 
k-point convergence of one-shot and self-consistent GW 
results: (1) The calculations of the trivial semimetallic 
phases can be considered converged with a 6x6x6 k- 
point set for the one-shot GW calculations and 4x4x4 
for the QSGW ones. (2) The semiconducting and topo¬ 
logical phases require much larger k-point meshes, but 
(3) the corresponding results converge systematically to¬ 
wards an extrapolated quadratic (linear) fit to the con¬ 
verged trivial semimetallic phases for the direct (indi¬ 
rect) band gap. We note that in the case of one-shot 
GW calculations, the discontinuity in the curves due to 
the k-point convergence of the topological phases does 
not appear at the TT transition of GW but at that of 
LDA, because of the use of the LDA mean-field system 
as the starting point for the one-shot GW quasiparticle 
correction. 










7 



(b) direct gap, QSGW 




da 


da 


FIG. 5. (Color online) LDA, GW, and QSGW' values of the direct and indirect band gaps of bulk Bi as a function of the ratio 
c/a for different k-point sets, but for a set of reduced convergence parameters, see text. 
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